AITopics | calibration measure

Collaborating Authors

calibration measure

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

d4cbcae8cfc8aa3ae897a1296e4e0cac-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 07:01:01 GMT

artificial intelligence, calibration measure, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States > California > Alameda County > Berkeley (0.04)

Genre: Research Report > Experimental Study (0.92)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.67)

Add feedback

d20e3c35bedb17a9f6f01fc434a30fa3-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 06:21:48 GMT

artificial intelligence, calibration, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas > Travis County > Austin (0.04)
Europe > France (0.04)
North America > United States > Virginia (0.04)
(5 more...)

Genre:

Research Report > Experimental Study (0.92)
Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Data Science (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

1c336b8080f82bcc2cd2499b4c57261d-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 15:03:23 GMT

calibration, calibration error, estimator, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
Europe > Sweden > Uppsala County > Uppsala (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(2 more...)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

X-CAL: Explicit Calibration for Survival Analysis Mark Goldstein

Neural Information Processing SystemsFeb-10-2026, 13:45:06 GMT

When a model's predicted number of events within any time interval is similar to the observed number, it is called well-calibrated . A survival model's calibration can be measured using, for instance, distributional calibration (

artificial intelligence, calibration, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report (0.47)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Calibration tests in multi-class classification: A unifying framework

Neural Information Processing SystemsDec-25-2025, 02:12:12 GMT

In safety-critical applications a probabilistic model is usually required to be calibrated, i.e., to capture the uncertainty of its predictions accurately. In multi-class classification, calibration of the most confident predictions only is often not sufficient. We propose and study calibration measures for multi-class classification that generalize existing measures such as the expected calibration error, the maximum calibration error, and the maximum mean calibration error. We propose and evaluate empirically different consistent and unbiased estimators for a specific class of measures based on matrix-valued kernels. Importantly, these estimators can be interpreted as test statistics associated with well-defined bounds and approximations of the p-value under the null hypothesis that the model is calibrated, significantly improving the interpretability of calibration measures, which otherwise lack any meaningful unit or scale.

calibration test, multi-class classification, name change, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

Truthfulness of Calibration Measures

Neural Information Processing SystemsOct-10-2025, 17:45:03 GMT

We study calibration measures in a sequential prediction setup.

artificial intelligence, calibration measure, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States > California > Alameda County > Berkeley (0.04)

Genre: Research Report > Experimental Study (0.92)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.67)

Add feedback

Testing Calibration in Nearly-Linear Time

Neural Information Processing SystemsOct-10-2025, 17:29:10 GMT

Probabilistic predictions are at the heart of modern data science.

artificial intelligence, calibration, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas > Travis County > Austin (0.04)
Europe > France (0.04)
North America > United States > Virginia (0.04)
(5 more...)

Genre:

Research Report > Experimental Study (0.92)
Research Report > New Finding (0.67)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

Making and Evaluating Calibrated Forecasts

Lu, Yuxuan, Wu, Yifan, Hartline, Jason, Hu, Lunjia

arXiv.org Machine LearningOct-9-2025

Calibrated predictions can be reliably interpreted as probabilities. An important step towards achieving better calibration is to design an appropriate calibration measure to meaningfully assess the miscalibration level of a predictor. A recent line of work initiated by Haghtalab et al. [2024] studies the design of truthful calibration measures: a truthful measure is minimized when a predictor outputs the true probabilities, whereas a non-truthful measure incentivizes the predictor to lie so as to appear more calibrated. All previous calibration measures were non-truthful until Hartline et al. [2025] introduced the first perfectly truthful calibration measures for binary prediction tasks in the batch setting. We introduce a perfectly truthful calibration measure for multi-class prediction tasks, generalizing the work of Hartline et al. [2025] beyond binary prediction. We study common methods of extending calibration measures from binary to multi-class prediction and identify ones that do or do not preserve truthfulness. In addition to truthfulness, we mathematically prove and empirically verify that our calibration measure exhibits superior robustness: it robustly preserves the ordering between dominant and dominated predictors, regardless of the choice of hyperparameters (bin sizes). This result addresses the non-robustness issue of binned ECE, which has been observed repeatedly in prior work.

calibration measure, classwise, predictor, (16 more...)

arXiv.org Machine Learning

2510.06388

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

1c336b8080f82bcc2cd2499b4c57261d-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 07:06:24 GMT

artificial intelligence, calibration error, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Uppsala County > Uppsala (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Sweden > Östergötland County > Linköping (0.04)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Calibration through the Lens of Indistinguishability

Gopalan, Parikshit, Hu, Lunjia

arXiv.org Machine LearningSep-3-2025

Calibration is a classical notion from the forecasting literature which aims to address the question: how should predicted probabilities be interpreted? In a world where we only get to observe (discrete) outcomes, how should we evaluate a predictor that hypothesizes (continuous) probabilities over possible outcomes? The study of calibration has seen a surge of recent interest, given the ubiquity of probabilistic predictions in machine learning. This survey describes recent work on the foundational questions of how to define and measure calibration error, and what these measures mean for downstream decision makers who wish to use the predictions to make decisions. A unifying viewpoint that emerges is that of calibration as a form of indistinguishability, between the world hypothesized by the predictor and the real world (governed by nature or the Bayes optimal predictor). In this view, various calibration measures quantify the extent to which the two worlds can be told apart by certain classes of distinguishers or statistical measures.

artificial intelligence, calibration, machine learning, (17 more...)

arXiv.org Machine Learning

2509.02279

Country: